Mining Market Basket Data Using Share Measures and Characterized Itemsets
نویسندگان
چکیده
Abs t r ac t . We propose the share-confidence framework for knowledge discovery from databases which addresses the problem of mining itemsets from market basket data. Our goal is two-fold: (1) to present new itemset measures which are practical and useful alternatives to the commonly used support measure; (2) to not only discover the buying patterns of customers, but also to discover customer profiles by partitioning customers into distinct classes. We present a new algorithm for classifying itemsets based upon characteristic attributes extracted from census or lifestyle data. Our algorithm combines the Apriori algorithm for discovering association rules between items in large databases, and the AOG algorithm for attribute-oriented generalization in large databases. We suggest how characterized itemsets can be generalized according to concept hierarchies associated with the characteristic attributes. Finally, we present experimental results that demonstrate the utility of the shareconfidence framework.
منابع مشابه
Mining Association Rules from Market Basket Data using Share Measures and Characterized Itemsets
We propose the share-conndence framework for knowledge discovery from databases which addresses the problem of mining characterized association rules from market basket data (i.e., itemsets). Our goal is to not only discover the buying patterns of customers, but also to discover customer prooles by partitioning customers into distinct classes. We present a new algorithm for classifying itemsets...
متن کاملRDB-MINER: A SQL-Based Algorithm for Mining True Relational Databases
Traditionally, research in the area of frequent itemset mining has focused on mining market basket data. Several algorithms and techniques have been introduced in the literature for mining data represented in basket data format. The primary objective of these algorithms has been to improve the performance of the mining process. Unlike basket data representation, no algorithms exist for mining f...
متن کاملAlgorithms for Association Rules
Association rules are ”if-then rules” with two measures which quantify the support and confidence of the rule for a given data set. Having their origin in market basked analysis, association rules are now one of the most popular tools in data mining. This popularity is to a large part due to the availability of efficient algorithms following from the development of the Apriori algorithm. We wil...
متن کاملQuantity Values in Association Rule Mining Using P-Trees
Association Rule Mining (ARM) in Market Basket Research (MBR) is most commonly used on binary data in databases (the customer bought/didn’t_buy values for each item). There are ARM techniques that address quantity data (how many did the customer buy) but using quantity data presents problems with storage and processing time. Peano Tree technology can respond to both types of problems because of...
متن کاملUser centric approach to itemset utility mining in Market Basket Analysis
Business intelligence is information about a company's past performance that is used to help predict the company's future performance. It can reveal emerging trends from which the company might profit [31]. Data mining allows users to sift through the enormous amount of information available in data warehouses; it is from this sifting process that business intelligence gems may be found [31]. W...
متن کامل